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Abstract 

In this paper we present two interesting properties of stochastic cellular au¬ 
tomata that can be helpful in analyzing the dynamical behavior of such au¬ 
tomata. The first property allows for calculating cell-wise probability distri¬ 
butions over the state set of a stochastic cellular automaton, i.e. images that 
show the average state of each cell during the evolution of the stochastic cellular 
automaton. The second property shows that stochastic cellular automata are 
equivalent to so-called stochastic mixtures of deterministic cellular automata. 
Based on this property, any stochastic cellular automaton can be decomposed 
into a set of deterministic cellular automata, each of which contributes to the 
behavior of the stochastic cellular automaton. 

Keywords: stochastic cellular automata, complexity analysis, continuous 
cellular automata, decomposition 


1. Introduction 

Cellular Automata (CAs) are often used for constructing models in a variety 
of fields of application, including chemistry, biology, medicine, physics, ecology 
and the study of socioeconomic interactions. In many of these settings. Stochas¬ 
tic CAs (SCAs) are considered due to the stochastic nature of the phenomenon 
under study or due to a lack of understanding of the exact rules driving the 
phenomenon mm- Therefore, a better understanding of the dynamics of SCAs 
is crucial. Only a few methods for dealing with the analysis of models based on 
SCAs have been developed. In many practical applications, especially in cases 
where the averaged behavior of the system is of concern, sampling methods, 
relying on extensive computer simulations, are sufficient [SIIIIS]. Techniques 
built on the mean-field theory can be used to study the long-term behavior of 
SCAs |5]. The theory of Markov chains can be applied to provide analytical 
tools for analyzing the model’s behavior [7], although in practice, due to the 
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theoretical and computational complexity of such tools, the application scope 
is limited. 

This paper is devoted to providing effective analytical tools based on de¬ 
terministic CAs for the analysis of multi-state SCAs. Although the theoretical 
foundations of the presented results are already (at least partially) known in 
the literature Ellin], so far the applications are limited. Therefore, the main 
aim of this paper is to provide a complete, formal description of the discussed 
properties in a form that is suitable for applications and that does not require a 
strong mathematical background, as well as to present examples that can mo¬ 
tivate further applications in the domain of systems modeling. The methods 
presented here are developed in the context of ID SCAs on finite lattices, but 
can be easily generalized to the case of higher dimensions and infinite spaces. 

Two main results are presented in this paper. The first one involves con¬ 
structing images that show the cell-wise probability distribution over the state 
set, at any time step. The method is based on associating an SCA with a de¬ 
terministic, Continuous CA (CCA). The second result shows the equivalence of 
SCAs and stochastic mixtures of deterministic CAs. Based on this finding, any 
SCA can be decomposed into a finite set of deterministic CAs, each of them 
contributing to the behavior of the stochastic system. An effective method for 
finding a decomposition is presented. It allows to uncover the deterministic 
component in the mixture with the highest impact on the behavior of the SCA. 

This paper is organized as follows. We start with some preliminaries and 
definitions in Section In Section we introduce the concept of CCAs and 
the formalism enabling the analysis of multi-state CAs. Section contains 
the definition of multi-state SCAs and holds the main results of this paper. 
The paper is concluded with Section discussing the experimental results that 
illustrate our results. A summary is presented in Section]^ 

2. Preliminaries 

Informally, a CA is a discrete dynamical system in which the space is subdi¬ 
vided into discrete elements, referred to as cells. At every consecutive, discrete 
time step, each cell is assigned one of N states using a deterministic rule, which 
depends only on the previous state of the considered cell and the states of its 
neighboring cells HUj. 

Formally speaking, let the state set 5 be a finite set of > 1 elements. 
Elements of the set C = {ci \ i = 1,..., M} will be referred to as cells. Ev¬ 
ery cell Ci is assigned a state s{ci,t) € 5 at each time step t S N, according 
to a local, deterministic rule. The vector s{-,t) G will be referred to as 
the configuration, and as the initial configuration when t = 0. The sequence 
(s(-, 0), s(-, 1),...) will be referred to as the space-time diagram of the CA. Eor 
technical reasons, we impose periodic boundary conditions, but our results do 
not depend on this assumption. 

The function A: satisfying, for every t gN: 

1) = A(s(-,t)), (1) 
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will be referred to as a global CA rule or simply a CA, if there exists a radius 
r e N and a function /: —>■ S satisfying: 

s(cj,t + l) = /(s(c2_-r,t),...,s(c2-|-7-,t)), (2) 

for every i and t S N. Such a function / will be referred to as a local rule. Note 
that a local rule uniquely defines the global rule, while for a given global rule, 
multiple local rules exist. Additionally, it is assumed that r M. The vector 
{ci-r, Ci-r+i ,..., Ci+r-i,Ci+r) will be referred to as the neighborhood of cell Ci 
and R = 2r + 1 will denote the neighborhood size. For the sake of simplicity, 
s{ci-r ,. ■., Ci+r, t) S will denote the state of the neighborhood of cell ci at 
time step t, and will be referred to as the neighborhood configuration of Ci at 
time step t. 

3. Continuous CAs 

There exist multiple ways of extending the definition of CAs to cover in¬ 
finite state sets. Examples of such approaches include Coupled Map Lattices 
(CMLs) [n] and so-called fuzzy CAs [laiisiiii]. In this section, we present 
Continuous CAs (CCAs), which can be seen as a generalization of the ideas 
presented in |12| . Our formalism is based on a polynomial representation of 
discrete CA rules. We start with formulating the continuous counterparts of 
binary CAs. After that we present a generalization to cover multi-state CAs. 

3.1. Binary CAs 

Binary CAs are widely studied [IlllTB], because they allow to evolve complex 
patterns and exhibit complex behavior despite their intrinsic simplicity. The 
state set of such a CA A is 5 = {0,1}. We will now formally define and 
characterize its local rule /: —>■ S. Let I = be a binary vector. We 

consider a system of equations: 

• ■ • : ~ 1 (^) 

where (s^p,..., Si^n) is a binary vector such that i = 1 -I- 
can be seen, such a system of equations is uniquely defined by the vector 1. The 
vector I will be referred to as the lookup table (LUT) of the local rule /. It is 
not difficult to check that such a system uniquely defines the function /, since 
it lists all of the possible input configurations, and maps them to corresponding 
values by components of the vector 1. 

Following |12j , we know that the function / can be expressed as a polynomial, 
which is of interest for our purposes. In order to define it, we introduce two 
auxiliary functions. We start with the function ind: {I,..., 2-^} —>■ (I, 2}^. It 
is defined in such a way that ind(f)[TO] is the m-th digit, incremented by one, 
read from left to right, of the binary representation of the integer i — I, padded 
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with ones on the left, so that it always has length R. Consequently, it holds 
that: 

R 

i = 1 + ^ (ind(i)[i? — (m — 1)] — 1) 2"*“^. (4) 

m—1 

The values of ind(i) for R = 3 and i G {1,..., 8} are shown below: 

ind(l) = (1,1,1), ind(2) = (1,1, 2), ind(3) = (1, 2,1), ind(4) = (1, 2, 2), 
ind(5) = (2,1,1), ind(6) = (2,1, 2), ind(7) = (2, 2,1), ind(8) = (2, 2, 2). 

The function ind is related to the binary representation of integers. In |12j a 
simpler formulation using the function bin, which yields the binary represen¬ 
tation of an integer, is used. The construction presented here, although a bit 
more complicated in the binary setting, allows for a smoother generalization to 
the multi-state case. 

Using the function ind, we now define the function n:5xNxN—which 
for s € S and m, z G N, is given by: 


n{s, m, i) 


s , if ind(z)[m] = 2, 
1 — s ,if ind(z)[m] = 1. 


Note that we will use vectors of states of the form (si,..., s^j) G and for 
simplicity, for any m G {!,..., i?}, we will write n{sm, i) instead of n{sm, w, i). 
Using the functions ind and n, we can write the polynomial representation of 
the local rule / as: 


2" / R 

i—l \m=l 


nis. 


(5) 


The following example shows the explicit form of this polynomial for a member 
of the family of Elementary CAs (EGAs). 


Example 1 (Elementary CAs). Binary, ID CAs with neighborhood radius 
r = 1 are commonly referred to as EGAs m- There are 256 such EGAs. 
Treating the LUT entries li as digits of an integer written in base 2, we can 
enumerate the local rules of EGAs. By convention, the binary vectors are read 
in the reverse order, i.e. (^i)f=i i® interpreted as {Ig^h, ■ ■ ■ ,h) 2 - For example, 
given the LUT I = (0,1,1,0,1, 0, 0,1) of EGA 150, and denoting the Boolean 
complement as s = 1 — s, its local rule can be written, according to Eq. ([^, as: 

/l5o(Sli S 2 , S3) = Si S2 S3 -I- Si S2 S3 -|- Si S2 S3 -I- Si S2 S3 . □ 


Using the above notation, a CCA can be defined analogously to a binary CA, 
with two notable differences. Firstly, the state set of a CCA is the unit interval, 
i.e. S = [0,1], and, secondly, the local rule /: [0,1]^ —)■ [0,1] is given by Eq. ^ 
with coefficients k G [0,1]. We will refer to such a vector (/i)f^i as a general¬ 
ized LUT. It is easy to check that this definition of a CCA is formally correct. 


4 


Indeed, the values of the function / in Eq. ([^ are guaranteed to belong to the 
unit interval if k £ [0,1] for all i and Sm G [0,1] for all m G {1,..., i?}. Note 
that this construction is directly related to the one presented in [12] , where fuzzy 
CAs are constructed as polynomials representing fuzzified logical functions. Fol¬ 
lowing the same line of reasoning, an alternative polynomial representation for 
the local rules of binary CAs is presented in [HI, which is consistent with a 
logical representation of the local rules. 

In order to introduce the formalism that is needed in the multi-state setting, 
we present a slightly modified way of representing binary CAs compared to the 
one obtained through Eq. (§. Let us assume that the state set is given by 
52 = {(1, 0), (0, 1)} C K^. Then the local rule is a function /: Sf —52 and 
can be represented as a vector function / = (/i,/ 2 ), where fj: 5^ —>■ {0,1}. 
Note that any s = (si, S 2 ) G 52 satisfies S 2 = 1 — si. Similarly, /2 = 1 — /i, so 
that the local rule / can be defined using /2 only. Let ^ G 5| be given, i.e. for 
every z = 1, ..., 2^, it holds that k = /i, 2 ) G 82 - Let (si,..., sr) G S^- We 

can define /2 with a formula similar to Eq. ([^ as: 

2« / \ 

/2(si, . . . , Sfl) = ^ k ^2 I «(Sm.2, *) 1 ■ (6) 

i=l \7n=l / 

Since Sm,i = 1 ~ Sm, 2 i we can rewrite the formula for the function n as: 


^(Sm,2; f ) 


Sm ,2 ,if ind(z)[m] = 2, 
,if ind(z)[m] = 1, 


which can be simplified to: 


and thus /2 can be written as: 


H 


/2(si, . ■ . , ^ ^ ^i,2 I ^777 I . 


(7) 


( 8 ) 


i—1 \m—l 


Having defined the function / 2 , we can prove the following fact, which gives 
a direct formula for fi. 

Lemma 1. The function fi can be expressed as: 

2 ’^ / R \ 

fl (si, . ■ . , Sr) n (9) 

2^1 \m^l / 

Due to the above, we can write a general expression for fj, j = 1, 2, as: 

2 ^ / R \ 

- j ^ ^ I '^m,ind(2)[m] 1 • (^0) 

2=1 \m=l / 
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Resorting to this vector notation, a CCA can be defined with a state set: 


Sc = {{.xi,X2) e [0,1]^ I + a ;2 = 1} C , (11) 


and a local rule f: Sf ^ Sc given by Eq. (|l0|) with G S; 


Example 2. The local rule /150 of ECA 150 presented in Example can be 
rewritten in the context of the state set ^ 2 , as /: —>■ 52 , / = (/i,/ 2 )- Note 
that for Si G S 2 , it holds that Si = (si^i,Si^ 2 )- The local rule / can be written 
as: 


/l(si, S 2 , S 3 ) — Sl ,2 S 2,2 S3,l + Si ,2 S 2 ,l S 3 , 2 + Si,i S 2,2 S 3 , 2 + Si,i S 2 ,l S3,i 
/ 2 (si, S 2 , S 3 ) = Si, 2 32,2 S 3 ,2 + Si, 2 S 2 ,l S 3 ,i + Si,i 32,2 S3,l + Si,i S 2 ,l S 3 , 2 ■ □ 


The representation given by Eq. (10) is equivalent to that given by Eq. 
but allows for an easier generalization to multi-state CAs. Therefore, we 
use it throughout the remainder of this paper. 


(§, 

will 


S.2. Multi-state CAs 


We start with generalizing the state set 52 defined in Section 3.1 to the case 
of N states. Let Sm C be the set of all base vectors of the A^-dimensional 
Euclidean space, i.e. 


5iv = {eG{0,l}^|||e||=l}. (12) 

Let the i-th state in a multi-state CA be represented by the base vector which 
has zeros everywhere, except at position where it has a one. For instance, 
if iV = 3, then 53 = {(1,0, 0), (0,1, 0), (0,0,1)}. For the sake of simplicity, in 
the remainder of this paper, we will write 5 instead of Sn-, assuming that N is 
fixed. 

We now generalize the function ind: {1,..., A^-^} —>• {1,..., N}^, defined 
above for the binary case, to enumerate the neighborhood configurations. 
Let J G {1,..., A^^} be the index identifying a neighborhood configuration. We 
assume that such a configuration consists of the following states: 


(^ind(j) [1 ]: ^ind(i) [2]; ■ ■ • : ^ind(,i) [i?] ) ■ 

We assume that ind satisfies the following equation: 

R 


i = l-f ^ (ind(i)[i?-(m-1)] - l) A^^-C 


A direct formula for calculating ind is given by: 

{i - 1 ) mod 


ind(t)[m] = 


jyR-r. 


1 . 


(13) 


(14) 


(15) 
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Essentially, ind(i) yields the positional representation of i — 1 in base N, 
padded with zeros on the left, so that it always has length R, where each of the 
digits is incremented by one. Hence, one easily finds: 


ind(l) = (1,.. 

-,1,1), 

ind(2) = (1,.. 

-,1,2), 

ind(A^) = (1,.. 

■,l,iV), 

md{N + l) = {!,.. 

-,2,1), 

= (Af,., 

..,N,N) 


Any local rule / of an A^-state, deterministic CA can be uniquely defined 
by writing down its outputs for all of the possible neighborhood configurations. 
Formally, let us consider the system of equations: 


/(Cind(i)[l]) ■ ■ ■ j 6ind(i)[i?] ) — hi (1®) 

where i G {!,...,A^^} and h G S. Similarly to the binary case, the matrix 
^ will be called the LUT of a multi-state CA. Basically, L 

is a matrix of A^^ columns and JV rows, containing only zeros and ones, in 
such a way that every column contains exactly one non-zero entry. Each of the 
matrices L G uniquely defines an A^-state CA in terms of its local rule. 

In order to define a CCA in the context of multiple states, we need to formally 
define its state set. Let Sc be defined as: 


5c = < {xi,.. .,xn) G [0,1] 


N 


N 

E 


Xi = l} C 


sJV 


(17) 


which is consistent with the definition for N = 2 given by Eq. (11). Note that 


any s G Sc satisfies Si = 1 — ■ The set Sc is commonly referred to as a 

standard {N — l)-simplex [T5] . 


Example 3. Let N = 3. The set Sc is a 2D triangle placed in the 3D Euclidean 
space, with vertices (1,0, 0), (0,1, 0) and (0,0,1). □ 


In the case of a CCA, the local rule is a vector function /: S^ —>■ 5c, given 
by / = {fi, ■ ■ ■, In), for which there exists a matrix P G S^ , referred to as 
the generalized LUT, such that for j S {1,..., A^} it holds that fj : S^ -G [0,1] 
and: 


(Sl, . ■ . , Sfi) ^ ) Pij 
i=l 


/ R 

n 

\m—l 


nd(i) [n 


(18) 


We will show that the definition given by Eq. ( [T^ is formally correct, mean¬ 
ing that the function / with components fj satisfies /: S^ —>■ 5c, which is 
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equivalent to showing that fj{si, ..., s_r) € [0,1] and ) ^r) = 1 

for any (si,... ,Sij) G S^. 

Lemma 2. For any (si,..., s_r) G 5^, it holds that 


AfH 

E 


2=1 


■ R 

n 


'®m,ind(2) [m] 


_m—l 


= 1 . 


(19) 


Proof. Let 


N^- 


0 = E 


n 


’m,ind(2) [m] 


Note that for every j G {1,..., N}^, there exists exactly one i G {1,..., N^} 
such that j — ind(i). Due to this, it holds that: 




.m—l 


By regrouping the elements of the above sum, we may write: 

■fl-i / N 

n Jtn I E/ 


0= 5: 


.m—l \2=1 


Since sr G we know that ^R,i — thus: 

■fi-i 


0= 5: 


n 

m—l 




By repeating this regrouping procedure R — 1 times, we finally get: 

N 

® = E J = 1 ■ 

i=i 


□ 


Proposition 1. For any j G {1,...,A^} and any {si,... ,sn) G S^, it holds 
that fj{si,..., Sr) G [0,1]. 

Proof. Firstly, /j(si,..., sr) > 0 is fulfilled since all of the elements in the sum 
in Eq. (18) are non-negative. Therefore, showing that fj{si,...,SR) < 1 is 
sufficient to prove the proposition. Since Pij < 1, we can write: 

fj{si,...,sR) = ( p 


/ R \ R 

'^m,ind('i)[m] j E En ^m,ind(z) [m] 
i —1 \m—l / 2=1 m—l 


= 1 . 


□ 













Proposition 2. For any (si,..., sr) G S^, it holds that: 

N 

= 1 . 

Proof. Let j G then it suffices to show that: 


1 - fj{Si, ...,Sr)='^ fk{si, ...,sr). 
This identity is proven by the following chain of equalities: 

/ R \ 


(20) 


1 ) ^r) — 1 ^ ^ Pij I '^m,ind(j)[m] j 

i—l \m—1 / 

\ ^ 

~ 1 ^ ^ j 1 ^ ^ Pik j '5m,ind(i)[m] 


2=1 




m—1 


R / R 

1 ^ ^ '®m,ind(2)[m] F ^ ^ ^ ^ Pik I '®m,ind(i) [m] 

2=1 m—1 2=1 k^j \m=l 




“ X! X! n '^m,ind(2) [m] /fe(si, • • • , Sr) . 


□ 


»=1 


\m—l 




We have shown the correctness of the CCA definition. We conclude this 
section with the observation that the local rule /: —>■ 5 of any A^-state, 

deterministic, ID CA can always be written in the form of Eq. (18), where 
(si, ■■.,sr) G S^, fj : -G {0,1} and fj{si, ■■■,sr) = 1. This shows 

that CCAs are a generalization of A^-state, deterministic CAs. 


4. Properties of SCAs 

In this section, we provide a general definition of an A-state SCA, which 
is formulated using the notation introduced in Section for multi-state CAs 
and CCAs. Subsequently, we show the nature of the relation between SCAs 
and CCAs. Finally, we present a method for decomposing an SCA into a set of 
deterministic CAs. 


4 . 1 . General construction of multi-state SCAs 

We define SCAs, the stochastic counterpart of multi-state CAs. We assume 
that the state set S is given by Eq. (12) and contains all the standard base 
vectors e^. In such a setting, the states are assigned according to a probability 
distribution, which depends on the neighborhood configuration at the previous 
time step. Therefore, SCAs are not defined through a local rule, but rather by 
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a set of conditional expressions, which for j G {1,..., TV} and A: € {1,, N^} 
can be written as: 


— 6j I Ci-i-T-, t) — ( 6 ind(fc)[l]; • ■ • ; ^ind(/c) [ 7 ?] — Pkj • (^1) 


The conditional probabilities pkj in Eq. (21) do not depend on the choice of i 


and t. Obviously, it holds that pk = (Pki, ■ ■ ■ iPun) G S,. for all k. Furthermore 
the matrix P = {pkj ) can be considered an element of . Consequently, every 
element in uniquely defines the probability distribution of some local rule. 
The matrix P will be referred to as the probabilistic lookup table (pLUT). Such 
matrices are commonly known as stochastic matrices m and are typically used 
in the analysis of Markov chains. 

Any SC A is u niquely defined by a pLUT belonging to . Since we know 
that every element of uniquely defines a CCA, we elaborate 

and a CCA 


3.2 


from Section 

on the relationship between an SCA defined by a pLUT P G 
defined by the same P considered as generalized LUT. 


4-2. Relationship between GCAs and SCAs 

In contrast to deterministic CAs, only the initial state s(',0) is known with 
certainty in the case of SCAs, while for s(-,t), t > 0, only a probability distri¬ 
bution can be calculated. Let Tr(ci,t \ Iq) denote the probability distribution 
over the set of states S for the i-th cell at time step t, given that the evolution 
started from the initial condition Iq, i.e. s(-, 0) = Iq- Whenever it will not bring 
confusion, we will omit Iq and write 7r(ci,t). Since there are N states, we need 
to specify N probabilities summing up to 1, to define 7r(ci,t). For this reason, 
let us represent Tr{ci,t) as a vector of probabilities, i.e. Tr(ci,t) G Sc, such that: 

7 r(ci,t I Io)[k] = P(^s(ci,t) = ej \ s(-,0) = / q ), (22) 


where j = 1 ,..., N. 

When using SCAs in practical problem settings, one might be interested in 
computing the values of 7r(ci, t) for any t. Normally, this is achieved by means of 
sampling methods I3I1I5], i.e. multiple space-time diagrams are generated from 
the same initial condition and probability distributions are estimated, based on 
the frequencies of reaching any of the states in a given cell. Unfortunately, such 
an approach has serious drawbacks, especially in terms of the computational 
burden it brings along. Yet, one can calculate 7r(ci, t) much faster by relying on 
CCAs. 

Proposition 3. Let P G be the pLUT of an SCA, and let f: — >■ Sc be 

a function defined according to Eq. ( |I^ using P as generalized LUT, then: 

7r(ci, t + 1) = f(7r(ci-r, t),..., 7r(ci+r, t)) , (23) 


for any i and t. 
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Proof. Let k £ and j £ iV}, then pkj is the probabil¬ 

ity of reaching state ej if the neighborhood configuration was known to be 
(eind(fc)[i]j • • •) eind(fe)[i?])j which will be referred to as the fc-th neighborhood. 
Assuming that 7 r(-,t) is known, the probability of cells (ci_r, ■ ■ •, being in 
the fc-th neighborhood can be calculated with the following formula: 

P^s(Ci_y, . . . , Ci-i-r, t) = (eind(fc)[l] 5 ■ ■ ■ ; ^ind(fc) [i?] 

R 

= n(24) 

m—l 

Due to the Total Probability Theorem, the probability of reaching state ej at 
time step t -I- 1 is expressed as: 


-I- 1) = Cj'j = 

^ ( P^' 5 (Ci, t -f 1 ) 6 j I s(Cj_7., • ■ - , Ci-i-T-, t) (Cind(A;)[l]; • ■ ■ ; ^ind(fc)[K] ^ 

P^s(c2_7., . . . , Ci+7., t) = (eind(/c)[l] 5 ■ • ■ 5 ^ind(fc)[i?] ■ (2d) 


k=l 


Substituting Eqs. (21) and (24) in Eq. (25), we get: 

/ R 


»(s(c„t + l) = e,) =Y,Pkj n r+m— 1; t) [ind(fc)[m]] j . (26) 


k —1 \m—l 


Taking into account the definition of the function / in Eq. (18), we can rewrite 
Eq. (26) as: 

p(^s(ci,t -I- 1) = Cj'j = fj{TT{c^-r,t), . . . ,7r(Cj+r,t)) . 

Since 7r(ci, t + l)[j] = P(s(ci, t -I- 1) = ej) , the following is proven: 

7T{Ci, t -b 1) = /(7r(Ci_r, t),..., Tr{Ci+r, t)) . □ 


As shown above, the construction of CCAs is directly connected to the Total 
Probability Theorem, and thus it enables us to better understand the evolution 
of cell-wise probability distributions of SCAs. It is important to highlight that 
the evolution of those distributions is deterministic in SCAs, although the dy¬ 
namical system itself is stochastic. The finding above agrees with [14], where a 
similar result in the case of binary CAs was shown. Yet, this paper was confined 
to study the asymptotic behavior of deterministic CAs. Interestingly, in an 
SCA is defined as a discrete-time deterministic dynamical system acting on 
the set of probability measures on the set of all configurations. Although the 
definition presented there is much more abstract, and suits mostly theoretical 
applications, it reassembles the main ideas of CCAs. 
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4-3. Decomposition of SCAs 

In the previous subsection, we have shown that CCAs are directly related 
to SCAs, since the evolution of a CCA is equivalent to the evolution of the 
cell-wise probability distributions of an SCA. Here, we uncover the relationship 
between SCAs and deterministic, fV-state CAs, and discuss the possibility of 
decomposing an SCA into deterministic CAs. Let us start with an introductory 
example motivating our general construction. 

Example 4 (a—asynchronous CAs). Classically, states in deterministic CAs 
are updated synchronously, i.e. a new state is assigned to all cells simultaneously 
at every time step according to the local rule. Yet, different approaches of break¬ 
ing the synchronicity of CAs have been proposed m- Interestingly, the choice 
of the update scheme, which defines the order or timing of cell state updates, 
has very important repercussions on the dynamical properties of CAs [22] • Here 
we focus on so-called a-asynchronous CAs (a-AC As) [23] . 

Any a-ACA is defined by its deterministic counterpart A and a probability a, 
called the synchrony rate, which controls whether or not its cells are updated. 
More precisely, a is the probability of applying the local rule / of A, i.e.: 

( ^ , IN _ J/(s(ci-r-, • ■ ■ ,Ci+r-,t)) , with probability a, 

|s(ci,t) , otherwise. 

Note that if a = 0, such a system remains at its initial configuration, whereas 
the system is equivalent to CA A if a = 1. □ 

The essential property of the construction presented above is that an a- 
ACA can also be seen as a synchronous, stochastic system in which the local 
rule of A is selected with probability a, while the identity rule is selected with 
probability 1 — a. Hence, we may say that the rule of a CA A is stochastically 
mixed with the identity rule. 

The idea behind a-ACA can be easily extended to the mixing oi q > 2 
deterministic rules. Consider synchronous, deterministic CAs Ai,...,Aq and 
probabilities ai,..., a^ that satisfy Yli=i ~ 1- Then, we define an SCA A 
in which the state of every cell at every consecutive time step is obtained by 
evaluating one of the local rules, corresponding to one of the A^. The rule 
is chosen randomly and independently for every cell at each time step with 
the selection probability for CA Ai being at. Such systems, referred to as 
stochastic mixtures of deterministic CAs in the remainder of this paper, hold 
very interesting properties that are currently also investigated by others |24] . 
For technical reasons, we assume that the local rules of Ai,..., A^ are defined 
with a common radius r > 0. Since each local rule with radius r' < r can 
be represented equivalently as a local rule with radius r, we are not loosing 
generality with this assumption. 

The first observation is a direct consequence of the definition presented 
above. 
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Fact 1. A ssume that r >Q is the radius of the neighborhood of the local rules of 
automata Ai,..., Aq. Let R = 2 r + 1 and suppose that Li G is the LUT of 
CA Ai, for i = 1,... ,q. Then the pLUT P G of the stochastic mixture A 
satisfies P = J2i=i oaLi- 

Sums of the form J2i=i where ai € [0,1] and J2i=i c«i = 1 are com¬ 
monly referred to as convex combinations, which allows us to rephrase Fact 
as: the pLUT of a stochastic mixture of deterministic CAs is the corresponding 
convex combination of the LUTs of the mixed CAs. 

Having defined the concept of a stochastic mixture, we are interested in 
characterizing this class of SCAs. Interestingly, we are able to show that in 
general there are no characteristics that distinguish stochastic mixtures from 
other SCAs, since any SCA can be represented as a stochastic mixture. This 
fact is not surprising if we envisage a stochastic mixture of deterministic CAs 
as a convex combination of LUTs of deterministic CAs. We can rely on the 
classical Krein-Milman theorem from convex set theory [^. It states that a 
compact convex set is the convex hull of its extreme points. In our context, the 
set containing the LUTs of deterministic CAs is the set of extreme points 
of , which is convex and compact. The convex hull of is the set of all 
possible convex combinations of elements from , which is precisely the set 
of all stochastic mixtures of deterministic CAs. Due to this, we can state the 
following theorem. 

Theorem 1. Any SCA can be represented as a stochastic mixture of a finite 
number of deterministic CAs. 

The decomposition of a stochastic mixture into a convex combination de¬ 
scribed above is not unique, therefore many methods for constructing the de¬ 
composition can be formulated. In the proof of Proposition]^ we introduce one 
possible algorithm of which it will be shown in Proposition that it uncovers 
the most influential component of the stochastic mixture. 

Proposition 4. Let P G . Then there exists an integer n* such that for 
i = 1, 2,..., n*, we can define coefficients ai G [0,1], X)r=i matrices 

L* G , that satisfy: 

n* 

P = Y,a,L\ (27) 

i=l 


Proof. We start by defining auxiliary matrices P'", for m > 0. Let: 


P , if TO = 0, 

,ifTO>0, 


where, for to > 0 : 


am = min max Pff ^, 
i j 


(28) 


(29) 
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(30) 


and L™ = {L"^) such that: 


T m _ 

^ij — 



for j = min |j | -P™ ^ \ for any / € {1,..., tV}| , 

otherwise. 


Note that, for any m and i, it holds that L™ = 1 for a single value of j. 
Consequently L™ e . 

Firstly, note that ai is one of the elements of = P, so ai € [0,1]. 

Additionally, it holds that ai > 0, since if ai = 0, then for some i, we would 
have maxj Pij = 0, which contradicts the fact that Pij = 1 for any i. 

Secondly, note that for any i and j, it holds that > 0. We will show 
next that the assumption PJ^~^ > 0 leads to P^ > 0. Indeed, we know that 
e - am}- If PIj = then obviously > 0, while 

if Pj™ = PiY~^ — am, it holds that P}J~^ > P™~^ for any I, meaning that 
P™“^ = max; P™~^- On the other hand, since am = min; max; PJY~^, it holds 
that P™~^ > am- Since it was assumed that PP~^ > 0, we see that PJ^ > 0 as 

•'J •'J ‘ •'J 

well. 

Thirdly, following the argument above, we can easily show that for every i, 
j and m, it holds that P;™ < Pij~^- This means that for every i, it holds that 
maxj P[j < maxj PP~^, and thus am+i < am- Since ai €]0,1], and am > 0 
for any m, we see that am € [0,1]. 

Note that for any i, it holds that 'Y2j ^ij = ~ Expanding 

this recursively, yields Ylj Pij = iJ2j Pij) ~ X)™ i o:i- From the definition of a 
stochastic matrix, we know that J^j Pij = such that J^j Pij = 1 ~ X)™ i 
which means that the column sums in matrix P™ are equal. Since for all m we 
have shown that P™ > 0, we know that for any to, it holds that I —X)™ i — f*) 
therefore YllLi ^ 1- 

We further note that am = 0, for some to, if and only if contains one 

zero column. Since we have shown that the column sums in P™ are equal for 
any to, we know that am = 0 if and only if p™“i = 0. Therefore, am > 0 if 
and only if ^ 0, which implies that in each of the columns of P™~^, there 

is at least one non-zero entry. 

Let z{m) = #{(i, j) | P™ = 0} denote the number of zeros in matrix P™. 
Note that if P™ ^ 0, then z{m) < z{m -I- I), which follows from the fact that 
if P™ 7 ^ 0 , there exists P™ = am+i > 0, and thus for at least one couple (i,j), 
we know that PP^^ = Pij — am+i = 0. Since z{m) cannot be greater than the 
total number of elements in P™, it is not possible that z(m) < z{m -I- I) for 
all TO. As such, we know that there exist n* for which P" = 0 and P" ^ 0, 
and thus Y2?=i = 1- Using the definition of P" multiple times, we get: 


n* 

0 = P"* = P"*-i P"* = ... = P° -ai pi - ... -a„* L”* = P-^ a, L\ 

i=l 


such that P = X)r=i F*. 


□ 
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To illustrate the construction presented in the proof of Proposition every 
step of the algorithm is visualized in the following example. 


Example 5. Although Proposition deals with stochastic matrices with di¬ 
mensions corresponding to pLUTs, it is obvious that the same method can be 
used to decompose a stochastic matrix of arbitrary size. For simplicity, we apply 
the method to an exemplary 4x3 stochastic matrix P given by: 



0.6 

0 

0.2 

0.4 

p = 

0.3 

1 

0.3 

0.4 


0.1 

0 

0.5 

0.2 


According to Eqs. ( 

28 

and ( 

301, we 

build matrices 

and 

Lb 




0.6 

0 0.2 

0.4 


'1 

0 

0 

l' 

II 

O 

= 

0.3 

1 0.3 

0.4 

, L^ = 

0 

1 

0 

0 



0.1 

0 0.5 

0.2 


0 

0 

1 

0 


Note that we highlighted the selected maximal elements in each of the columns 
using boldface font. The boldface entries in correspond to the positions of 
the ones in matrix L^. Note that in the last column of P^, there is no unique 
maximal element. Yet, as given by Eq. (301, we pick the first of the entries 
from top to bottom. Picking the other possibility would affect the subsequent 
matrices L™ and result in a different decomposition, but the coefficients am 
would not change. 

Following Eq. (291, we find that ai = 0.4. We proceed and calculate P^ 
and 



' 0.2 

0 

0.2 

0 ■ 



0 

0 

0 

0 ‘ 

pi = - Oi pi = 

0.3 

0.6 

0.3 

0.4 


L 2 = 

1 

1 

1 

1 


0.1 

0 

0.1 

0.2 



0 

0 

0 

0 

and from this we see that 

02 = 

0.3. 

We continue the procedure: 




0.2 

0 

0.2 

0 ■ 



'l 

0 

1 

0 ‘ 

P 2 = pi - 02 = 

0 

0.3 

0 

0.1 


= 

0 

1 

0 

0 


0.1 

0 

0.1 

0.2 



_0 

0 

0 

1 

and 03 = 0.2. We proceed 

one 

more 

step and get: 







■ 0 

0 

0 

0 ■ 



0 

0 

0 

o' 

P^ = P^ - as = 

0 

0.1 

0 

0.1 


L^ = 

0 

1 

0 

1 


0.1 

0 

0.1 

0 



1 

0 

1 

0 


and 04 = 0.1. This is the final step, since: P^ = P^ — a^L'^ = 0. Therefore, 
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the decomposition can be written as: 



'l 

0 

0 

1' 


'0 

0 

0 

o' 

P = 0.4 X 

0 

1 

0 

0 

-l- 0.3 X 

1 

1 

1 

1 


_0 

0 

1 

0 _ 


_0 

0 

0 

0 _ 


'l 

0 

1 

o' 


'0 

0 

0 

o' 

+0.2 X 

0 

1 

0 

0 

+ 0.1 X 

0 

1 

0 

1 


0 

0 

0 

1 


1 

0 

1 

0 


As mentioned earlier, the construction presented in the proof of Proposi¬ 
tion]^ is one of many possibilities of decomposing a pLUT, but in the following 
proposition we will show that it enables us to capture the element of the mixture 
with the highest possible coefficient. 

Proposition 5. Let P G 5^^, p = 1,... ,pm ax 7 Q ■ j Qmax ? dTld let 

ctp, I3q G [0,1], LP,M‘^ G be such that — 1- Moreover, 

let P = = E, Pq M‘>, and let ap, he defined as in the proof of 

Proposition^ It then holds that max^ Pq < ai = maxp ap. 

Proof. Let i and j be such that Pij = ai. Then for any k, it holds that 
Pik < cti. Let q G {!,.. ■ ,qmax} and let I be such that = 1. Since 
P = Y^qPqM'i, we know that Pq < Pu, and thus Pq < Pu < ai. Therefore, 
for any q G {1,..., qmax}-, h holds that Pq < ai and thus max^ Pq < ai. □ 

The informal meaning of the above proposition is that the presented ap¬ 
proach to decompose an SCA uncovers the deterministic rule that has the high¬ 
est probability of being executed, and therefore is likely to have the highest 
influence on the behavior of the system. The exact relations between the dy¬ 
namical behavior of SCAs and the dynamical characteristics of the components 
of stochastic mixtures are still under investigation. 

The experiments in the following section underline that in some cases those 
CAs that have the highest probability of application indeed greatly influence 
the behavior of the SCA. Yet, in some cases, those with a very small probability 
of application can also play an important role. Therefore, at this stage we do 
not claim that the overall dynamics of the SCA is always determined by the 
component of the mixture with the highest probability of application. 

The decomposition of an SCA as a stochastic mixture might also find its 
use in SCA-based modeling. Indeed, in some cases it might be easier to express 
the model in terms of deterministic components having a meaning in the lan¬ 
guage relevant to the modeling task. Due to the presented equivalence, we are 
guaranteed that such a practice will not limit the modeling potential. 

5. Experiments 

5.1. Analysis of a-asynchronous EGAs 

Through the following experiment, we will analyze the behavior of a-asyn¬ 
chronous ECAs. For a given rule, the cell-wise distance between the space- 
time diagram of the deterministic CA and the space-time diagram of a CCA 
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representation of an a-AC A variant, for a ranging from 0 to 1, was measured. 
More formally, if A is an EGA rule, and A„ is the CCA representation of the 
a-asynchronous variant of A for a € [0,1], we measured the distance D{a) 
between space-time diagrams. For a random initial condition Iq G {0,1}^, 
where M > 0 denotes the number of cells and T the number of time steps, the 
distance D is dehned as: 

Di.cx) = ^jlU\h)-Al{I,)\\, (31) 

t=i 


where A*{Io) denotes the result of applying the global rule A t times to input Jq. 
In this experiment, we choose M = T = 69. 

Unsurprisingly, it turned out that the most pronounced discrepancies be¬ 
tween the parallel evolutions are observed when a approaches 1. For that rea¬ 
son, we restrict the further discussion to a € [0.9,1]. We classified the EGAs 
according to the behavior of the function D (the contents of the classes is shown 
in Table [^: 

• Class I: D{a) is almost 0 , for all a G [0.9, 1 ] (Figure [Ta]), 

• Class II: D{a) smoothly decreases towards 0, as a increases (Figure [Tb|, 


Class Ilia: there is a very sudden drop of D{a) as a approaches 1 (Fig¬ 
ure 


Ic), 


• Class Illb: there is a very sudden drop of D{a) as a approaches 1, and 
the behavior is not monotonic (Figure [Td]). 


The interpretation of the classes is as follows. Rules belonging to Class I are 
resistant to a-asynchronicity, which means that introducing the asynchronic¬ 
ity aspect does not affect the behavior, or that its impact is negligible. Rules 
belonging to Class II are affected by the asynchronicity, but the impact on the 
final behavior is proportional to the synchrony rate a, i.e. the behavior steadily 
approaches the deterministic case as a approaches one. Rules belonging to 
Classes Ilia and Illb are sensitive to a-asynchronicity, meaning that the behav¬ 
ior of the system changes drastically as soon as a is smaller than I. Class Illb 
can be distinguished from Class Ilia, by the noisy behavior of D{a) for a close 
to one. The cause of the differences is not yet uncovered. 

Classes Ilia and Illb illustrate the case where the decomposition as a stochas¬ 
tic mixture does not suffice to understand the dynamics of the SCA. Indeed, 
for large a, the behavior of the SCA and that of the deterministic CA with the 
highest probability of application is quite different. 


5.2. Stochastic density classifiers 

We consider the SCA defined by the local rule C 3 introduced in [lillT]. 
This rule is defined by the pLUT shown in Table Note that this pLUT uses 
the standard notation for its state set S = {0,1}, and its entries constitute the 
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0.9 0.91 0.92 0.93 0.94 0.95 0.96 0.97 0.98 0.99 


(a) Class I (EGA 40) 



(b) Class II (EGA 42) 


0.9 0.91 0.92 0.93 0.94 0.95 0.96 0.97 0.98 0.99 


(c) Class Ilia (EGA 38) 



(d) Class Illb (EGA 43) 


Figure 1: Plots of D{a) for representatives of each of the classes defined in Table 

0 
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(a) Averaged 


(b) Sample #1 


(c) Sample #2 


(d) Failing sample 


Figure 2: Space time diagrams of C 3 with rj = 0.1. 


probabilities of reaching state 1. It was shown both analytically and experimen¬ 
tally that this rule is a stochastic solution for the classical density classification 
problem (DCP) |26j with arbitrary precision, i.e. by varying the parameter 77 , 
we can achieve any probability p < 1 of successful classification. The DCP was 
introduced in [551 US] • 


( 1 , 1 , 1 ) 

( 1 , 1 , 0 ) 

( 1 , 0 , 1 ) 

( 1 , 0 , 0 ) 

o' 

( 0 , 1 , 0 ) 

( 0 , 0 , 1 ) 

( 0 , 0 , 0 ) 

I 

V 

I 

1 — t] 

I 

0 

0 

0 


Table 2: pLUT of local rule C 3 [551157] . 

Using the CCA representation, we can visualize the average behavior of C 3 . 
In Figure the first 80 rows of the averaged space-time diagram (time goes 
from top to bottom) obtained using the CCA representation of C 3 with rj = 0.1 
are shown, together with three typical space-time diagrams obtained by direct 
evaluation of the SCA rule C 3 . All images were obtained using the same initial 
condition involving 29 cells out of which 16 were black (state I). As can be 
inferred from Figure [^ two samples lead to behavior that is similar to the one 
displayed by the average space-time diagram, while the one depicted in Figure [2d| 
behaves differently, and leads to a wrong classification. Since the probability of 
obtaining a correct classification for the initial condition at stake was estimated 
to be 90.1% on the basis of 10000 repetitions, the impact of the erroneous cases 
on the averaged image is hardly visible. 

One of the most interesting questions within the analysis of DCP solutions 
relates to the expected time of convergence towards the outcome of the classifi- 
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2000 



grid size 


(a) CCA 



grid size 


(b) SCA 


Figure 3: Expected convergence time of rule C 3 for different values of rj. 


cation and how this relates to the number of cells. Figure [^depicts the average 
convergence time calculated over an ensemble of 5000 random initial conditions 
for different numbers of cells and values of rj, both for the CCA and the SCA 
representation. The initial conditions were generated as follows. For every ini¬ 
tial condition independently, a probability p was selected randomly, and then 
the initial states were selected with p being the probability of selecting state 0 
at each cell independently. Such a selection procedure assures that each of the 
possible densities has the same probability in the ensemble of initial conditions, 
and thus we can evaluate the classifier across a diverse set of densities. 

In the case of the CCA representation, we cannot expect reaching a truly 
homogeneous, global state. Therefore we evolved it until the maximum, absolute 
difference between the states of any two cells was lower than 0.001. Then, we 
verified whether the average of the states was closer to 1 or 0. In the case of the 
SCA, for each of the 5000 initial conditions, 100 simulations were performed, 
and the results were averaged. 

The results for small grid sizes (smaller than 200) are similar for both repre¬ 
sentations and agree with the findings presented in US mi- Yet, the computing 
time when using the CCA representation is significantly lower than in the case 
of SCAs. This indicates that the CCA representation of SCAs proposed in this 
paper might enable a substantial reduction of the required computing time. 
Moreover, using the CCA representation, we are able to get more insight into 
the behavior of the dynamical system by plotting the evolution of the density 
over time. Figure shows the results of such an experiment. For the sake of 
clarity, these results are based on an ensemble of 400 initial conditions (69 cells) 
out of which 200 had a density greater than 0.5 (green), while the other 200 had 
a density smaller than 0.5 (red). The ensemble of initial configurations was the 
same for all values of f]. Note that the range of the horizontal axis differs across 
plots. From Figure we can infer that the time and quality of classification 
increases with decreasing 77 . Due to the computational costs involved in evolv¬ 
ing and averaging SCAs directly, drawing and analyzing such plots has become 
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possible only due to the introduction of CCAs. 

5.3. Totalistic SC As 

In this section, we analyze a specific class of totalistic SCAs. Although most 
likely being of no practical use, it holds many similarities with more complex 
CAs and SCAs that are often resorted to in modelling. Totalistic CAs are gen¬ 
erally known for their ability to mimic real-world phenomena [niiiniisni- Given 
their importance and straightforward formulation, we have chosen this class to 
serve as an example for our analysis. The focus is on the illustration of the tools 
introduced in this paper, rather than to unveil new properties of some complex 
CA models. 

We consider the class of ID, binary SCAs with a unit neighborhood radius, 
that satisfy the following conditions: 

P(s(Ci,t-|- 1 ) = 62 I s{Ci_i,Ci,Ci+i,t) = (62,61,61)) = 

P(s(ci,t-M) = 62 I s{ci_i,Ci,Ci+i,t) = (61,62,61)) = 

¥{s(Ci,t+ 1 ) = 62 I s{Ci-i,Ci,Ci+i,t) = (61,61,62)) =Pl, 

¥{s(ci,t + 1 ) = 62 I s{ci-i,Ci,Ci+i,t) = (61,62,62)) = 

P(s(ci,t-|- 1 ) = 62 I s{Ci-i,Ci,Ci+i,t) = (62,61,62)) = 

P(s(Ci,t-|- 1 ) = 62 I s{Ci-i,Ci,Ci+i,t) = (62,62,61)) =P 2 , 

where ei and 62 are the base vectors of the Euclidean space K^. Such SCAs 
are commonly referred to as totalistic SCAs. We consider the subclass of such 
SCAs satisfying: 

P(s(Ci, t -I- 1 ) = 62 I s(Ci_i, Ci, Ci+i, t) = (62, 62,62)) = 1 , 

P(s(ci, t -I- 1 ) = 61 I s(ci_i, Ci, Ci+i, t) = (61,61, ei)) = 1 . 

In this paper, we will refer to this subclass as totalistic SCAs. By applying 
the decomposition algorithm given by Proposition]^ we can prove the following 
fact, revealing their structure. 

Fact 2. Any totalistic SCA can be written as a stochastic mixture of the fol¬ 
lowing ECAs: 128, 150, 232, 254, where up to three of those rules are applied 
with non-zero probability. 

We can determine the regions in the (pi,p 2 )“plaiie where a given determin¬ 
istic CA has the highest probability of application, i.e. where it is dominant. 
These regions are given by: 

• ECA 128 is dominant, if pi < 0.5 and p 2 < 0.5, 

• ECA 150 is dominant, if pi < 0.5 and P 2 > 0.5, 

• ECA 232 is dominant, if pi > 0.5 and p 2 < 0.5, 
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0 2000 4000 6000 8000 10000 12000 14000 

time step 

(e) ri = 0.001 


Figure 4: Density evolution over time for the CCA representation of SCA C 3 
for 200 initial conditions with density greater than 0.5 (green) and 200 other 
initial conditions with density smaller than 0.5 (red) and different values rj. 


22 















0 0.2 0.4 0.6 0.8 1 0 0.2 0.4 0.6 0.8 1 

Pi Pi 


(a) ai5o (b) 0232 

Figure 5: Dependence of the application probabilities: ai 5 o of EGA 150 and 
Q !232 of EGA 232 in the totalistic SGA, on the values of the parameters pi and 
P2- 


• EGA 254 is dominant, if pi > 0.5 and p 2 > 0.5. 

Note that if pi = 0.5 or p 2 = 0.5, there is no unique, dominant rule, therefore 
we have omitted those cases in the description above. 

Let ai 28 , Qfiso, 0232 , 0:254 denote the probabilities of applying EGAs 128, 
150, 232, 254, respectively. Figure depicts the dependence of 0150 and 0232 
on Pi and P 2 - We have omitted the remaining two images for ai 28 and 0254 , 
since they do not yield additional information. 

The dynamical characteristics of the four EGAs that compose the totalistic 
SGAs differ significantly. Rules 128 and 254 are simple - they belong to Glass I 
according to Wolfram’s classification [16) . and their maximum Lyapunov expo¬ 
nents (MLEs) [3T] equals — 00 , which means that any changes to their initial 
configuration do not influence their long-term behavior. Rule 232 belongs to 
complexity Glass II, and its MLE is positive, but very close to zero. In contrast, 
rule 150 is a Glass III rule, and its MLE is the highest among all EGAs, which 
highlights the fact that this rule is sensitive to the smallest perturbation of its 
initial configuration. Since rules 150 and 232 are relatively more complex than 
rules 128 and 254, we might expect a distinct behavior of the stochastic mixture 
in regions where the former are dominant. 

In order to verify this, we set up an experiment involving a random ini¬ 
tial condition of M = 49 cells which was evolved 100 times for T = 49 time 
steps. Such a procedure, for the same initial condition, was repeated for mul¬ 
tiple different choices of {pi,P 2 ) using a 101 x 101 regular grid, resulting in a 
set of space-time diagrams. The set of such space-time diagrams is denoted 
as X{pi,p 2 ). Let A(pi,p 2 ) = {dist(/, J) | I ^ J,I,J e I{pi,p 2 )} denote 
the set of all pair-wise Hamming distances between space-time diagrams, and 
At(pi,P 2 ) = {dist(/[T], J[T]) \ I J] I, J G 2{pi,p2)} denote the set of all 
pair-wise Hamming distances between the final configurations in the space-time 
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diagrams. The results for the minimum, mean and maximum of A(pi,p 2 ) and 
At{pi,P 2 ) are shown in Figure]^ 

As can be seen from the charts, the distance is especially high in those 
regions where the application probability ai 5 o is high, i.e. where pi > 0.5 and 
P 2 < 0.5, and near the line P 2 = —pi in the quarter pi < 0.5 and P 2 > 0.5. The 
first region is the one where EGA 150 is dominant, while EGA 232 is dominant 
in the second one, but with EGA 150 having a substantially high application 
probability as well. Although the Hamming distance is just a simple indication 
of complexity, we already see a strong influence of the EGA 150 component. 

The findings presented above suggest that analyzing the components of a 
stochastic mixture decomposition of an SGA unveils information on the dy¬ 
namics of the SGA. In contrast to the a-AG As in Glasses Ilia and Illb (c/. 
Table [^, here we encounter a class of SGA for which uncovering the compo¬ 
nents of the stochastic mixture with relatively high probability of application 
gives additional insight into the dynamical properties of the SGA. 

6 . Summary 

In this paper, two basic, yet important properties of SGAs were discussed. 
We have shown, both theoretically and practically, that SGAs can be effectively 
analyzed in the context of deterministic GAs and GGAs. The stochastic mixture 
representation of SGAs allows to understand the underlying dynamics, while 
the GGA representation allows to quickly uncover the average behavior of the 
system. Further research is undertaken to expand the application scope of the 
presented methods. 
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